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Abstract 



Arikan's polar coding method is extended to two-user multiple-access channels. It is shown that if the two users of the channel 
use the Ankan construction, the resulting channels will polarize to one of five possible extremals, on each of which uncoded 
transmission is optimal. The sum rate achieved by this coding technique is the one that correponds to uniform input distributions. 
The encoding and decoding complexities and the error performance of these codes are as in the single-user case: O(nlogn) for 
encoding and decoding, and o(exp(— n 1//2_E )) for block error probability, where n is the block length. 



Polar coding, invented by Ankan [1], is a technique for achieving the 'symmetric capacity' of binary input, memoryless 
channels. The underlying principle of the technique is to convert repeated uses of a given single-user channel to single uses 
of a set of extremal channels — almost every channel in the set is either almost perfect, or almost useless. Ankan calls this 
phenomenon polarization. In this note we describe a way to extend this technique to multiple-access channels (MACs). 

One way to do this extension is via the 'rate splitting/onion peeling' scheme of [2], [3]. In Appendix A, we describe how 
arbitrary points in the capacity region of a given MAC can be achieved using polar codes and rate splitting techniques. 

The approach taken in here is different, partly because our motivation is to see whether multiple-access channels polarize 
in the same way as single-user channels do. In the following, we will describe a technique to 'polarize' a given two-user 
multiple-access channel in the same sense as in [1], i.e., we will convert repeated uses of this MAC into single uses of 
extremal MACs. Whereas in the single user case there are only two extremal channels (perfect or useless), we will see that in 
the multiple-access case there will be five. 

The coding scheme that results from this construction shares some properties of the single-user case: the encoding and 
decoding complexity is O(nlogn), n being the block length, and the block enor probability is roughly 0(2 _v/ "). Also 
analogous to the single-user polar codes' achieving the 'symmetric capacity', codes for the multiple-access channel are capable 
of achieving some of the rate pairs on the dominant face of the rate region obtained with uniformly distributed inputs. 



Let P: X x W —> y be a two user multiple-access channel with input alphabets X = W — ¥ q = {0, 1, . . . , q — 1}, where q 
is a prime number. The output alphabet y may be arbitrary. The channel is specified by P(y\x,w), the conditional probability 
of each output symbol y E y for each possible input symbol pair (x, w) £ X x W. The capacity region of such a channel is 
given by 
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x,w 



where 



D\{W,X) ={(i?x,i? 2 ): < Ri < I(X;YW), 
< i? 2 < I{W;YX), 
Ri+R2< I(XW;Y)}, 



the union is over all random variables X G X, W £ W, and Y € y jointly distributed as 

Pxwy{x, w,y) =Px(x)p w (w)P(y | x,w), 

and co(S) denotes the convex hull of the set SQ. 

In this note, rather than the capacity region, we will be interested in the region 

3(P) := 9\(X, W) where X and W are uniformly distributed on ¥ q . 

Given such a channel P and independent random variables X, W uniformly distributed on F g , define 

7 (1) (P) := I(X;YW), I (2 \P) := I(W;YX), and I^- l2 \P) := I(XW;Y), 



and let 



K(P) := (I^{P),I^(P),I (12) (P)) e M 3 . 



'All logarithms in this note will be to the base q. This in particular implies that I(X; YW),I(W; YX) 6 [0, 1] and I(XW; Y) £ [0, 2]. 
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Note that the region 3{P) is defined by 

3(P) = {(R U R 2 ): < i?i < I (1) (P), 0<R 2 < I {2) {P), Ri + R 2 < I (12) (P)}. 

Further note that maxfj' 1 ', 1^} < /( 12 ) < /W + I^ 2 \ therefore the constraints that define 3(P) are polymatroidal. In 
particular, there exists (Ri,R 2 ) € 3(P) for which R\ + R 2 = I^ 12 \ The set of such points is called the dominant face of 
3{P). 

III. Polarization 

Two independent uses of P yields a multiple-access channel P 2 with input alphabets X 2 and W 2 , and output alphabet y 2 . 
Setting the inputs (X\,X 2 , W\,W 2 ) to be independent and uniformly distributed on ¥ q , and letting (Yi, Y 2 ) denote the output, 
the region 3{P 2 ) is described by the three quantities 

liX^Y^W^) = 2/«(P), 
IiW^Y^X^) = 2I<- 2 \P), and 
KX^WxW^YxY^ = 2I l - 12 \P) 

that upper bound R\, R 2 and R\+R 2 respectively. Now consider putting the pair (X\,X 2 ) e F 2 in one-to-one correspondence 
with (Ui,U 2 ) e¥ 2 via 

X x = U x + U 2 , X 2 = U 2 
and the pair (Wi, W 2 ) € V 2 in one-to-one correspondence with (Vi, V 2 ) G F 2 via 

w 1 = v 1 + v 2 , w 2 = v 2 , 

where both additions are modulo-g. Observe that {U\, U 2 ,V\, V 2 ) are also independent and uniformly distributed on ¥ q . Note 
further that 

2I^{P) = /(X^Y^WWs) (1) 
= I(U l XJ 2 \Y X Y 2 V X V 2 ) 
= IiUKYjYiVM) + HUxYiYiVMU!) 

> HUnYMV!) + IiUxYiYiVMU!), 

2/(2) ( P ) = I{W{W 2 ; Y Y Y 2 X X X 2 ) (2) 
= HVxV^Y^U^) 
= J(V i; Y X Y 2 U X U 2 ) + HVxYiTiUiUiV!) 

> I(Vi;YiY 2 Ui) + I(V 2 ;YiY 2 UiU 2 Vi), 

and 

2/(12) (p) = i (XiX 2 W\W 2 ; Y\Y 2 ) (3) 
= HJJxUiVxV^YxYi) 
= I{U 1 V 1 -Y 1 Y 2 ) + nUtVxYMUM). 

Observe that the quantities 

I(Ui;Y x Y2Vi), I{Vi;YiYiU x ), and I{U x Vx;Y x Y 2 ) 
are those that describe the region associated to the g-ary input multiple-access channel U\V\ — > Y{Y 2 , and the quantities 

IiU^Y^V^Ui), UYxYiYiUMV!), and /(I^Vfc; YilWiVi) 

are those that describe the region associated to the q-ary input multiple-access channel U 2 V 2 — > Y{Y 2 U{V\. This motivates the 
following. 

Definition 4: Suppose P: X x W ^ y is a two user multiple-access channel with input alphabet ¥ q . Define two new 
multiple-access channels, P~ : X x W ^ y x y and P+ : X x W ^ y x y x X x W as: 

P~{yi,V2\ui,Vi) = jsP(yiWi+u 2 ,v 1 +v 2 )P(y 2 \u 2 ,v 2 ), 
u 2 ex v 2 ew 

P + (y 1 ,y 2 ,u 1 ,v 1 \u 2 ,v 2 ) = ^P(yi\ui +u 2 ,v 1 + v 2 )P(y 2 \u 2 ,v 2 ). 
The channels P~ and P + correspond to U\V\ — > Y\Y 2 and U 2 V 2 — >■ Y\Y 2 U\V\ above, respectively. It is clear that the 
channel P~ can be synthesized from two independent uses of the channel P, whereas the channel P + in general cannot, since 



3 



at its output we require (Ui,Vi) in addition to (Y^Ya). However, P + can be synthesized from two uses of the channel P 
with the aid of a genie that delivers (Ui, V\) as side information to the output terminal. 

Note that the channel P~ is 'worse' and the channel P + is 'better' than the channel P in the sense that I^(P-) < 
jO)(p) < for each a G {1,2,12}. To see this, observe that if we process the output (2/1,2/2, wi> v%) of the channel 

P+ to keep only y 2 , the resulting channel is identical to P. Thus I^ a \P+) > I^(P). That /( Q )(p-) < I^ a \P) then follows 
from ([TJ, |2]) and (|3). Consequently, 

J(P~) C 3(P) C 3(P + ). 

Furthermore, by virtue of equations ([T|, Q and 



|J(P-) + | 



3(P+) C 3(P), 



where the left-hand side of the above denotes set sum, i.e. 



\b: ae3(p-),be3{P+)} 



\3{P + ) that are on 



\3{p-) + p{P + ) = {\u , 2 . 

Nevertheless, by the polymatroidal nature of (1^, I^ 2 \ I^ 12 -*) and by ([3]), there are points in ^3(P~ 
the dominant face of 3(P). 

We have now seen that from two independent copies of a q-ary input multiple-access channel P we can derive two g-ary 
input multiple-access channels P~ and P + . Applying the same process to P~ and P + , we can derive from four independent 
copies of P, four g-ary input multiple-access channels P := (P~)~, P h := (P~) + , P H := (P + ) _ and P ++ := (P + ) + . 
Recursively applying the process £ times results in 2 e q-aiy input multiple-access channels 



P- 



These channels have the property that the set 



,P H 



a(p s ) 



is a subset of 3(P), but contains points on the dominant face of 3(P). 

The main result reported in this section is that these derived channels polarize in the following sense: 

Theorem 5: Let P be a q-ary input multiple-access channel. Let M := {(0,0,0), (0,1,1), (1,0,1), (1,1,1), (1,1,2)} Cl 3 , 

and for p € R 3 , let dip, Mj := min^gM \\p — x\\ denote the distance from a point p to M. Then, for any S > 

Hm ^ #{ s e {".+}' : d (^( pS ) ; M ) > -5} = 0. 

That is, except for a vanishing fraction, the regions 3(P S ) approach one of five possible regions. 
Remark 6: The five limiting regions in Theorem [5] are the following. 




(000) 



R2 

i4 



(Oil) 



Ri 



(101) 



R-2 



L 

I -I 1— 



(ill) 



Ri 




(112) 



The first case, (000), is that of a channel whose output provides no useful information about any of its inputs; the second 
and third, (011) and (101), are channels that provide complete information about one of the inputs but nothing about the other; 
the fourth, (ill), is a pure contention channel; the last, (112), is one whose output determines both inputs perfectly. 

Theorem [5] will be proved as a corollary to Theorem [13] below. To prove the latter theorem, we need a few auxiliary results: 

Lemma 7 ([6]): For any e > 0, there is a 6 := 6(e) > such that if 
(i) Q : V q — > B is a q-ary input channel with arbitrary output alphabet £>, and 



4 



(ii) Ai, A 2 , B\, B 2 are random variables jointly distributed as 

PA 1 A 2 B 1 B 2 {ai,a2,bi,b 2 ) = ^rQ(&i|ffli + a 2 )Q(b 2 \a 2 ), and 

(iii) I(A 2] B X B 2 A X ) - I{A 2 ;B 2 ) < 5, 
then, 

I(A 2 ;B 2 )<£(e,l-e). 

Note that S can be chosen irrespective of the alphabet B. 

Corollary 8: For any e > there exists a 5 > such that if P is a two-user q-ary input multiple-access channel with 

7W(P+)- jW(p) <( 5 7 
then, 7 (1) (P) ^ (e, 1 - e). Similarly, if P is such that 

j( 2 )(P+)-/( 2 )(P) <S, 

then, 7< 2 )(P) £ (e, 1 - e). 

Proof: It suffices to prove the first claim. To that end, note that /W (P) = I(U 2 ; Y 2 V 2 ) and /W (P+) = I{U 2 ;Y x Y 2 UiVxV 2 ), 
where 

Pc/iVi^^i-ii^i^i^, "2,2/1, y 2 ) = ^"^(y 1 I Ul +"2,«i + v 2 )P(y 2 | it 2 ,u 2 ). 
We have by hypothesis that 

5>I{U 2 ;Y 1 Y 2 V 1 V 2 U 1 )-I{U 2 ;Y 2 V 2 ), 

It can easily be checked that the values of the above mutual informations remain unaltered if evaluated under the joint 
distribution 

qu 1 v 1 u 2 v 2 Y 1 Y 2 (ui,v 1 ,u 2 ,v 2 ,y 1 ,y 2 ) = ^P(yi | «i +u 2 ,v 1 )P(y 2 | u 2 ,v 2 ). 

Defining Ai = Ui, P; = (Yi, Vi) and Q(y,v\u) = ^P(y\u,v), one can then write 

qA 1 A 2 B 1 B 2 (a>i,a 2 ,b 1 ,b 2 ) = ^Q(&i | a x +a 2 )<5(6 2 | a 2 ). 

Applying Lemma [7] now yields the claim. ■ 
Lemma 9: For any e > there exists a 5 > such that whenever P is a two-user g-ary input multiple-access channel with 

j( 12 )(P+)-/( 12 )(P)<5, then 

I {12 \P) - 7«(P) i (e, 1 - e) for j = 1, 2. 
Proof: By symmetry, it suffices to prove the claim for j = 1. Choose S so that 5 < e and 5 < 5(e) of Lemma [7] Note 

that 

5> j(12)(p+)_ j(i 2 )(p) 

= /([/ 2 v 2 ;y 1 [/ 1 y 1 |y 2 ) 

= j(c/ 2 ;y x y 2 f/i) -/([/ 2 ;F 2 ). 

Applying Lemma pi with = £/<, P; = 3^ and Q(y|it) = J2v \P(v\ u i v ) we conclude that I(U 2 ;Y 2 ) ^ (e, 1 — e). Since 
/(P 2 ; y 2 ) = j( 12 )(?) - /W(P), the claim follows. ■ 
Suppose P is a two-user q-ary input MAC. Let Bx,B 2 , . . . be an i.i.d. sequence of random variables taking values in the 
set {-,+}, with Pr(Pi = -) = Pr(Pi = +) = 1/2. Define a MAC-valued random process {P f : I > 0} via 

P :=P P e :=Pf_\,£>l. (10) 

Further define random processes {/j 1 ) : £ > 0}, {/j 2) : I > 0} and {/j 12 ) : I > 0} as 

P?W(i)(P), 4 2 ) : =/( 2 )(p), and 7i 12 ):=/( 12 )(P). (11) 

Lemma 12: The processes {/j 1 ) : £ > 0} and {/j 2 - 1 : > 0} are bounded supermartingales, the process {/j 12 ' : I > 0} is a 
bounded martingale. 

Proof: Since p> is a q-ary input MAC, /j 1 ' and /j 2 ' take values is [0, 1] and /j 12 - 1 takes values in [0,2], and thus the 
processes are bounded. The martingale claims follow from ([TJ, (|2]i and ^ respectively. ■ 
Theorem 13: The process (ijfi, l[ 2 \ lj> 12 ^) converges almost surely, and the limit 

(l£\lgKl£V):=lim(lP,4 2 \in 

i— J- OO 
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belongs to the set {(0,0,0), (0, 1, 1), (1,0, 1), (1, 1, 1), (1, 1,2)} with probability 1. 

Proof: Let (f2, Pr, P) be the probability space these processes are defined in. Let 

A:= \co £Q: lim li a) exists for each a £ {1. 2, 12}). 

The almost sure convergence of ij, 1 ) and follow from them being bounded supermartingales, almost sure convergence of 
/j 12 ) follows form it being a bounded martingale. Thus, Pr(^4) = 1, and it remains to show that the joint limit belongs to the 
set claimed. 

To that end we will first show that € {0,1} a.s. Since converges a.s., lim^_ i . 00 jljVj — = a.s. Since 
- tP\ is bounded (by 1), it follows that lim^.^ - = 0. But 

S[|7i|) 1 -4 1 )|]>I[/( 1 )(P+)-7( 1 )(P,)], 

and we see that lim^oo / (1) (P/) - I [1) {Pi) = 0. From Corollary [£] we conclude that lim^oo 1^ £ {0, 1}. 

Swapping the roles of the two users yields 7g) £ {0, 1} a.s. We thus find that {l£>,lg>) is equal to either (0,0), (0, 1), 
(1, 0), or (1, 1). Denoting the set of lo £ A for which l£) = a, = b by A ab , we see that A = A 0Q U A al U A w U A xl . 
Since 

ma X {/( 1 )j (2) }</ (12) </ (1) +/ (2) ! 

we conclude that the value of I^ 2 ** in Aqq, Aqi and A\q is 0, 1, and 1 respectively. 

All that remains now is to show that I^ 2 -* belongs to {1, 2} for lu £ An. To that end note that for any u) € An, 

(i) lim^ 00 /( 1 )(P,) = 1, lim^ 00 /( 2 )(P,) = 1, and 

(ii) lim^oc I (12) (P) exists and thus lim^ 00 |/( 12 )(P f+ i) - I (12) (P^)| = 0. 
But 

|/ (12) (P+i) - I^\P e )\ = /( 12 >(P+) - I (12) (P), 

and thus lim^oo /( 12 )(P + ) - /( 12 )(P £ ) = 0. Now Lemmapjjlets us conclude that lim^^ I {12 \Pt) £ {1, 2}. ■ 
Proof of Theoremwr When the processes Pi and Pf 1 , ^j 12 ')' ^ = 0, 1, . . . are defined as in ( fT0) i and ( fTTj i, respectively, 
we have 

Pr[d((4ViVi 12) ),M) > 5] = -#{se {-,+} f : d(/C(P s ),Af) > <5} . 
The claim then follows from Theorem Q~3] 



IV. Rate of polarization 

We have seen that any g-ary input MAC can be polarized to a set of five extremal MACs, by recursively applying the channel 
combining/splitting procedure of Section III Furthermore, Remark [6] suggests a natural scheme to exploit this phenomenon- 



polar coding [1]: one can hope to communicate reliably by sending uncoded information over the reliable channels, and not 
sending any information over the others. In this section, we will formalize this intuition, showing that such a coding scheme 
achieves points on the dominant face of 3(P). 

We first introduce some notation: Given a q-ary input multiple-access channel P, define two point-to-point channels 

P[U] : ¥ q -)• y and P[U \ V] : ¥ q -» y x ¥ q through 

piu]( y \u) = ±Y, p (y\ u > v ) 

V 

P[U\V](y,v\u) = \P(y\u,v) 

That is, P[U] is the channel U -> Y, and P[U | V] is the channel U YV. Define P[V] and P[V \ U] analogously. Also, 
for every a, 7 £ ¥ q define the channels P[a, 7] : ¥ q — >• y through 

P[a, 1 ](y\s) = \ p (v\ u ^) 

u.v. 

OtU-\-~fV=S 

That is, P[a, 7] is the channel all + jV — > Y. 

Given a point-to-point channel Q : ¥ q — > y, let P e (Q) denote its average probability of error with uniform input distribution 
and the optimal (ML) decision rule. Also let I(Q) denote the mutual information developed across Q with uniform inputs. 
That is, 
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Finally let Z(Q) define the Bhattacharyya parameter of Q, defined as 

Z (Q) = ^h) E E VWT^oUiW)- 

x^x' y€zB 

It is known (see [5]) that P e (Q) < qZ(Q). 

We are now ready to describe the encoding rule: Fix £ and let n — 2 . Let B n denote the n x n permutation matrix called the 
'bit reversal' operator in [1], and let G n = [ \ J]® denote the £th Kronecker power of the matrix [ \ \\. Put U n := (U\, . . . , U n ) 
and V n = (Vi, . . . , V n ) into one-to-one correspondence with X n = (Xi, . . . , X n ) and W n = (Wi, . . . , W n ) via 

X n = U n B n G n , 
W n = V n B n G n . 

Transmit (X n , W n ) over n independent uses of P and receive Y n . Defining Pr* : X x W — > y n x A" 1-1 x W 1_1 to be the 
channel U t V ->■ y^-i^-i, we see that 

P (1) is P ; 



P(2) is P" 
P(3) is P- 



P (n) is P+-++. 



It then follows from Theorem |5] that when n is large, almost all channels P^ are close to one of the five limiting channels. 
Also note that the ith channel assumes a genie that provides knowledge of the previous symbols (U' t ~ 1 V l ~ 1 ) at the receiver. 
This observation and Remark [6] motivate the following coding scheme: Fix e > 0, <5 > 0. Let Ajj C (U\,...,U n ) and 
Ay C (Vi, . . . , V n ) denote the sets of information symbols to be transmitted. Choose these sets as follows: 
(i) If ||/C(P (l) ) - (0,0,0)|| < e then U t $ AuM $ A v , 
(s.i) if ||/C(P W ) - (0, 1, 1)|| < e then U t £ A Vl V t e A v , 
(s.ii) if ||/C(P (4) ) - (1,0, 1)|| < e then U l G A v ,Vi £ A v , 

(s.iii) if ||/C(P W ) - (1, 1, 1)|| < e then either {7, e ^c/, V l £ A v , or ^ £ Au,Vi G -4 V , 
(s.iv) if ||/C(P (<) ) - (1, 1, 2)|| < e then C/ 4 e ^y, V5 e 
(s.v) otherwise, f/j ^ Au,Vi £ Ay- 
Choose the symbols in .4^ and yly independently and uniformly at random, and reveal their values to the receiver. This 
choice of Ajj and Ay ensures that all the information symbols see 'reliable' channels, provided that the previous symbols are 
decoded correctly. Consequently, upon receiving Y", the receiver may attempt to decode the symbols successively, in the order 
(UiVi), (U2V2) . . . , and hope for a low block error probability. Furthermore, Theorem [5] and the preservation of I^ 12 \P) 
throughout the recursive channel splitting/combining process guarantee that for any choice of e and 5 there exists no such that 
\Au\ + \A V \> n[lW(P)-S\ whenever n> hq. This observation hints at the achievability of points on the dominant face of 
3(P). For a proof of achievability, it only remains to show that the block error probability of the discussed scheme vanishes 
with increasing block length. We do this next. 

Let ^ : y n x F* _1 x F* _1 , i=l,...,n denote the ML decision rule for estimating (/T^) given (Y n , {UV)^ 1 ). Note that 
this corresponds to a genie aided decision rule — the genie provides (UV)' l ~ l — for estimating (UiVi) from the output Y n . Let 
Ei denote the event 0j(F™, (UV) 1 ^ 1 ) 7^ (UiVi). Observe that Ei is precisely the error event of P^y Now define a standalone 
decoder, recursively through 

Ti = (j>i(Y n 1 T i ~ 1 ), i=l,...,n, 
and let E[ denote the event Ti ^ [UiVi). Note that ^JiE[ is the block error event for the scheme discussed above, and that 

Ui-Ej - Ui-Ej. 

Hence, the block error probability can be bounded as 

Prfblock error] = Pr[U^] - Pr^P,] < E Pr I^] = E P ^ P (^- ^ 14 > 

i i 

Note that the transmission scheme described above implies that the only non-zero error terms on the right-hand-side of ( fl4| ) 
are those corresponding to the symbols in Ajj and Ay. We will show that almost all of these terms are sufficiently small, i.e., 
that by removing a negligible fraction of information bits from Ajj and Ay, the above sum can be made to vanish. 

Theorem 15: For any f3 < 1/2, the block error probability of the polar coding scheme described above, under successive 
cancellation decoding, is o(2 - ™' 3 ). 



Theorem 15 is an immediate corollary to the following result. 
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''■ ||/C(P S ) -(0,1, 1)11 <e,P e (P s [V])>2- 21 
||/C(P S ) -(1,0, 1)11 <e,P e (P s [£/]) >2" 2 ^ 



Lemma 16: For any e > and /3 < 1/2 
(r.l) lim^oo ^#|s € {-,+} 
(r.2) lim^oo ^#|s e 
(r.3) lim^oo ^# js € 
(r.4) lim^^ ^#{s€{-,l}' 

The following proposition will be useful in the proof of Lemma |T6] 
Proposition 17: For all a, 7 e F 9 and J > 0, 

1 



= 0, 
= 0, 



|/C(P S ) - (1,1,1)|| < e,max{P e (P s [C7 | U]),P e (P s [y | U})} > 2^} =0, 
|/C(P S ) - (1, 1, 2)|| < e, P e (P s [U}) + P e (P s [V}) > 2- 2 " 3 } - 0. 



lim #{se{-,+^:/(P s [a, 7 ])e (S,l-S)} = 0. 



- . • 1 - ' \ ,.. , . (18) 

That is, the channels a[7 + 7 F — > Y polarize to become either perfect or useless. Moreover, convergence to perfect channels 
is almost surely fast: 



Hm ~#{s G {-,+}*: 7(P>, 7 ]) > 1 - 6, Z(P*[a, 7]) > 2"""} 







(19) 



for all < /?, <5 < 1/2 and a, 7 e F 9 . 

Proof: See Appendix B. ■ 
Proof of Lemma \16\ The proof of (r.l) follows immediately from Proposition 17 by taking a — and 7 = 1, and by the 

relation P e (P s [V}) < qZ(P s [V}). Proofs of (r.2) and (r.4) follow similarly. 
To prove (r.3), we first observe that for any MAC P, 



max{P e (P[t/ I V]),P e (P[V I U})} < P e (P[a,j]) < qZ{P[a,j\) 



for all a, 7 € F g . We also know from Proposition 17 that when £ is sufficiently large, then for all a, 7 € F g , 

/(P>, 7 ])£( (e),l- (e)) 



(20) 



(21) 



for almost all s e { — , It is an immediate consequence of Lemma 33 in Appendix C that whenever ( f2T| is satisfied, then 

||/C(P S ) - (1,1,1)|| < e implies I(P[a,7]) > 1 - o(e) 
for some a, 7. Claim (r.3) will then follow from ^9} and pO) . 



V. Discussion 

The technique described above adapts the single-user polarization technique of Ankan to the two-user multiple-access 
channels. It can be seen that it retains the quality of being low complexity, and has similar error probability scaling as the 
single-user case. 

As in the original polar code construction for single user channels, the discussion for MAC polar codes above consider 
uniform input distributions. How to achieve true channel capacity with polar codes, using Gallager's method [8, p. 208], is 
discussed in [5]. The arguments in [5, Section III.D] can easily be adapted to multiple-access channels to extend the above 
results to rate regions with non-uniform inputs. 

A number of questions for further study come to mind: Unlike the single-user setting, where the 'symmetric capacity' of 
a channel is a single number, the 'dominant face' of the set of rates that correspond to uniformly distributed inputs is a line 
segment. The polarization technique outlined here does not in general achieve the whole segment, but only a subset of it, for 
the simple reason that the equations (1) and (2) are inequalities rather than equalities. Is there an alternative way to do MAC 
polarizaton and not suffer this loss? 

A natural extension of the results presented is to the case of multiple-access channels with more than two users. For such 
channels, one can fairly easily show that with a similar construction as in this paper, there are a finite number of limiting 
MACs, and that these extremal MACs have the property that their rate regions are described by polymatroidal equations with 
integer right-hand sides, and are thus matroids. One encounters, however, a new phenomenon: not all matroids are possible 
regions of a MAC. The treatment of these require further techniques, which is the subject of [7]. 



Appendix A 

In this section, we discuss how polar codes can be used to achieve arbitrary points in the capacity region of any MAC with 
arbitrary number of users and discrete input alphabets. We follow the notation used in Section |ll] For sake of simplicity, we 
show the achievability of corner points of 3(P) for a given g-ary input two-user MAC P, and discuss how the result can be 
generalized. 
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Theorem 22: Let P be a two-user g-ary input MAC. For any e > and j3 < 1/2, there exist two polar codes C\ and C 2 
with sufficiently large block lengths n, and with rates 

Ri >I(X;Y)-e 
R 2 >I(W;YX)-e 



such that if used by the two senders for transmission over P, their average block error probability does not exceed 2~" . This 
performance is guaranteed under a receiver that decodes the messages successively. 

Proof: Given a single-user g-ary input channel Q, let P ej „(Q, A, uj^) denote the block error probability of a polar 
code under successive cancellation (SC) decoding, with information set A and frozen symbols fixed to u A a, averaged over all 
messages. We know from [1] and [5] that when n is sufficiently large, there exists a set A with |.4| > n(I(Q) — e) and 

1 ^- l J2PeAQ,A,u A c)=0(2- nfi ). (23) 

Define two g-ary input channels Qi : F 2 — > y and Q 2 : F 2 — > y through the transition probabilities 

Qi(i/|aO = !X^(vl*.«0> 

w 

Q 2 (y,x I w) = iP(y I x,w). 

Clearly, we have I(Qi) — I{X;Y) and I(Q 2 ) — I(W;YX). Take n sufficiently large and find sets Ai and A 2 with 
|^4i| > n(J(Qi) - e) and \A 2 \ > n(I(Q 2 ) - e), such that (f23]l holds when (Q,A) is replaced with {Qi,Ai) and (Q 2 ,^2), 
respectively. We will show that the ensemble of polar code pairs characterized by A\ and _4 2 nave small average error 
probability when used for transmission over P. 

Consider a receiver that first makes an SC estimate X n = <fix(Y n ) on the first sender's codeword X n based on the output 
Y n , and then produces W n = <j)w(Y n X n ), where 4>w denotes the SC estimate of W n conditioned on (Y n X n ). That is, the 
decoder for W n assumes that the decision on X n is always correct. The average block error probability of this scheme can 
be bounded using the relations 

Pr [block error] = Pr[X n ^ X n or W n ^ W n ] 

= Pr[0x(^) ^ X n ] + Fv[<t>w{Y n X n ) ^ W n ,X n = X n ] 

= Fr[(t>x{Y) ^ X n ] + Pr[(t) W (Y n X n ) £ W n ,X n = X n ] 

< Pr[cb x (Y) ? X n ] + Pr[<My"X") W n ]. 

The first probability term above can be written as 

Pr[<MY n ) * * n ] = \ E F ^x(Y n ) ^ X n \W n = w n ] 

= C(2-™' 3 ). 

Here, we obtained the second equality by observing that the codeword symbols X n and W n are independent and uniformly 
distributed, which follows from the uniform distribution on the frozen and information symbols. The third inequality follows 
from ( p3| . By the same line of argument one can write 

Vi[<f) W {Y n X n ) £ W n ] = Y, F ^w{Y n X n ) y£ W?,X? = sf] 

x i 

r^TT Pe,n(Q2, A 2 ,U A c) 



U AC 

9, 



= 0(2'" ). 

Therefore, the block error probability, averaged over the ensemble of polar code pairs is 0{2~ nt> ). This lets us conclude that 
there exists at least one pair of polar codes with the promised rates and average block error probability. ■ 
In [2] and [3], it was shown that any point in the capacity region of an A/-user MAC can be expressed as a corner point of 
(at most) a (2M — l)-user MAC rate region, possibly with non-uniform inputs. In addition, it is shown in [5, Section III] how 
polar codes for non-binary channels can be used to achieve capacity of arbitrary discrete channels, by inducing arbitrary non- 



uniform distributions on the input. Modifying the above proof along these observations, one can easily generalize Theorem 22 
in order to show that polar codes achieve all points in the capacity region of any discrete input MAC with arbitrary number 
of users. 
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Appendix B: Proof of Proposition[T71 
Given a channel Q : ¥ q y, define two channels Q b : ¥ q ->• ^ 2 and Q 5 : ¥ q -> y 2 x F 9 through 

Q 6 (2/i>I/2 I ^l) = ^ |Q(s/i I ^l + x 2 )Q(y 2 \ x 2 ), 

Q 9 (2/1) 2/2, sci I is) = |Q(3/i I »i + x 2 )Q{y 2 \ x 2 ). 

It is easy to see that 7(Q b ) + J(Qfl) = 2J(Q). We will show that for all 0,76 F 9 , 

(i) P[a,j] 9 is degraded with respect to P + [a,7], 

(ii) P[a,j) b is equivalent to P + [a, 7], 
implying 

/(P+[a,7]) + /(P-[a ! 7])>2/(P[a,7])- 

This, in addition to (i), (ii), and Lemma |7j implies the convergence of the channels P[a, 7] to extremals — the proof is identical 
to that of Corollary [H] That is, 

Urn ^# {s G {-, +} e : I(P s [a n ]) G (<S, 1 - <5)} = 0. 
To prove the claim on the rate of convergence, we will show that 

Z(p-[a,i\)<2Z{P[a,j\) and Z(P+[a, 7]) < gZ(P[a, 7 ]) 2 - (24) 

The proof will then follow from previous results, namely 

Lemma 25 ([4], [5]): For any g-ary input channel Q: ¥ 2 — > y, channels Q b and Q 9 satisfy 

Z(Q b ) < 2Z{Q) and Z{Q 9 ) < qZ{Q) 2 . (26) 

In particular, this implies that 

Hm ^# {s G {5, b} e : I(Q S ) > 1 - e, Z(Q S ) > 2" 2 " } = (27) 

for all < e,(3 < 1/2. 
It thus remains to show (i), (ii), and ( |2"4| ). 
Proof of (i): We have by definition 

P + [a, 7] (2/1) 2/2,^1,^1 I s) = -P + (2/i)2/2,wi,wi I u 2 ,u 2 ) 

= X! ■ q \P(yi\ui+u 2 ,v 1 +v 2 )P(y 2 \u 2 ,v 2 ). (28) 

112,1)2: 

Q«2+7112=S 

On the other hand, 

P[a,j] 9 (yi,y2,x I s) = ip[a,7](yi | a; + s)P[a,j](y 2 \ s) 

= -q\ P(yi\u 1 ,v 1 )P(y 2 \u 2 ,v 2 ) 

u 1 ,v 1 ,U2,v 2 : 
aui+jvi—x+s 
au 2 +-fv 2 = s 

Since the constraints au\ + jVi = x + s and au 2 + jv 2 — s are linear, the above sum can be rewritten as 

P[a,^] 9 (yi,y 2 ,u 1 ,v 1 I s) = 4j ^2 P(yi\ui+u 2 ,v 1 +v 2 )P(y 2 \u 2 ,v 2 ). (29) 

ux, 111, 112,112: 

Qlil+7Ul=X 
Q!U2+7ll2 = S 

Comparing ( |28] i and ( |29] i, we observe that the channel P[a, 7p is obtained by processing the output (Yi,Y 2 , U\, Vi) of P + [a, 7] 
to retain (Yi, cct/i + 7V1). This completes the proof. 
Proof of (ii): We have 

^~ [a, 7] (2/1)2/2) I x) = V ~P~{yx,V2 I «i,«i) 

Ul,lii: * 
awi+7iii=x 

= J! ^P(2/i I "1 + "2, wi + v 2 )P(y 2 I tia,«a). (30) 

OtU\-\-^V\=X 
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On the other hand, 

P[u,j] b (yuy2 | x) = i^P[a, 7 ](yi | x + s)P[a,j\(y 2 \ s) 

S 

E Y^P{yi\ui,v 1 )P(y 2 \u2,v 2 ) 



u 1 ,v 1 ,u 2 ,v 2 : s 
au 2 +^v 2 =s 



As in the proof of (i), we can rewrite the above sum as 

P[0t,i\ b {yi,V2,Ul,Vl | S) = Y YjsP{yi\u 1 +U2,Vi+V 2 )P{y2\u2,V2) 



u 1 ,v 1 ,u 2 ,v 2 : s 
awi+7ui- x 
au 2 +~jv 2 =a 



£ ^P{yi\ui+U2,Vi+V 2 )P{y2\u2,V2). (31) 
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u 1 ,v 1 ,u 2 ,v 2 : 
CtU\+^V\=X 



Comparing ( f30] > and pT) we conclude that P[a,j] b and P [a, 7] are equivalent. 

Proof of {24) : It immediately follows from ( p6| ) and (ii) that Z(P b [a, 7]) < 2Z(P[a, 7]). In order to complete the proof, 
we will show that Z(P+[a,j\) < Z(P[a^] 9 ). It will then follow from (f26]l that Z(P+[a,j]) < qZ(P[a,~f}) 2 . 
Define the channels 

P^vA^lMv^y^ I s ) = 9 2p+ K 7] 0/1:2/2, Ml, W2 I s), 
^[a,7] 9 (yi,2/2 I s) = qP[a,j} 9 (y 1 ,y 2 ,x \ s). 

An inspection of |28) and ( |29| ) reveals that 

^[a, 7 ] 9 (yi,y 2 I a) = E 7] I s). 



aui+jv-i—x 



Also, we clearly have 



Z(P+[a, 7 }) = ^ ZiP+^la^}), 

ui,tii 



It then follows from the concavity of the Bhattacharyya parameter in the channel (cf. Lemma 32 below) that 

X 

= \zZ z { E 

X \Qlli +7'Ul — X / 

>?E E ^(^Kt]) 

x aui-^-yvi—x 

= 7]) 

= Z(P+[a, 7 ]), 

completing the proof. 

Lemma 32: Let Q,Q%, . . . , Qk be g-ary input channels with 



A' 

Q = E Pfc( 5 fc > 



where p/t > and J2kP k = Then, 



fc=i 



A' 



fe=i 
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Proof: The proof is identical to that of [1, Lemma 4]: 

1 



Z{Q) = 



> 



9-1 
1 



V V x / 

- 1 + ^EE^ (e I x)) 

V k \ x / 



9-1 

k 

Here, the inequality follows from [8, p. 524, ineq. (h)]. ■ 

Appendix C 

Lemma 33: Let X, W £ ¥ q be independent and uniformly distributed random variables, and let Y be an arbitrary random 
variable. For every e > 0, there exists S > such that 

(i) I(X;Y) < 5, I(W;Y) < 5, H(X \ YW) < 5, H{W \ YX) < 5 and 

(ii) H(aX + jW | Y) (5, 1 - 5) for all aje F„ 
implies 

I{a'X + iW;Y)>l-e. 



for some a', 7' £ ¥ q . 

Proof: Let ir be a permutation on F 9 , and let 



i if w = n(x) 
otherwise 



Note that H(X) = H(W) = 1 and H(W \ X) = H(X \ W) = whenever (X, is distributed as Pjr . We claim that for 
every ir, there exist 0:^,7^ € F q \{0} such that 

H(a w X + lv W) <l-c(q), 

where c(q) > depends only on q. To see this, given a permutation ir, let 

a* :=7r(0)-7r(l), 7^=1, /* := tt(0). (34) 

Clearly, a w 7^ 0. It is also easy to check that with these definitions we have 

Pv[a v X + l7r W = fj]> Pr[(X, W) = (0, tt(0))] + Pr[(X, W) = (1, tt(1))] = §, 

which yields the claim. It also follows from the continuity of entropy in the L\ metric that 

\\pxw -p w ||<e implies %X + y n W) < (1 - c(g))(l - 0(e))- 1 . 

We now show that for every e > 0, there exists a <5 > such that whenever H(W\YX) < 5, H(X\YW) < 5, I(W; Y) < 5 
and I(X; Y) < 5, then there is a set S of y's with p Y (S) > I — e such that for all y £ S 

min\\pxw\Y= y ~ Pk\\i < e- 

From /(W; V) < 5, Pinsker's inequality yields 

-Pw\Y= y \\ i < V2<fln2 < 2 A 

y 

and we conclude that the set 

G:={y:||i- P w|Y=Ji<<5 1/4 } 
has probability at least 1 - 25 1 / 4 . Note that for y £ G, 

\-\5 1/i <Pw\y= v m<\ + \6^ 

\-\6 1/A <Pw\Y=,m<\ + \* llA 

and thus 

(1 - 5V*)Mv) < PY\w(y\0) < (1 + <5 1/4 W(2/) 
(1 - 5 1/4 )pr(t/) < py\ w (y\l) < (1 + 5 1/4 K(2/) 
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Furthermore, as 

6 > H(X\WY) = Y^Pwy{w, y)H(X\W = w,Y = y), 

w,y 

the set {(w,y): H(X\W — w,Y = y) > yd} has probability at most y/6. Let 

B w = {y: H{X\W = w,Y = y) > VS}, ki£F, and B = U W B W . 

Then, 

Py(GnB w ) = MV) 

< [i - ^-i E PY\wiy\w) 
^[l-s^r 1 E py\w(v\w) 

< [i-^/4]-i2 E 

yeB w 

for all w G F„, and thus 

Py(GnB) < 2gV5[l - 5 1 / 4 ]- 1 , 

and the set 5 = G n £> c has probability 

iV(5) > 1 - 25 1/4 - 2qVd/[l - 6 1/4 } = 1 - o(S). 
Note that for all y g S we have for any w, \~ — Pw\Y=y( w )\ < °(^)> an d Px|vky (^1^, y) ^ (o(<$), 1 — o(<5)), and thus 

min\\pwx\Y= y ~pA < o(5). 
In particular, this implies that there exist tt' and S' C S with Py(S') > Py(i5')/g! such that 

lbwx|y= a -fe'll < o(S) 
for all y E S 1 . Letting a 1 = a n t and 7' = 7^/, where av and 7^/ are defined as in ( |34) , we obtain 

(a'X + 7 'W I y) < Fy(5')(l - c(g))(l - 0(e))- 1 + P Y (S' C ) 
= (l- C2 )(l-o(6))- 1 

where c 2 > depends only on q. Noting that I(a'X + j'W; Y) ^ (6,1 — 5) by assumption, and that 

I(a'X + iW\ Y) = H(a'X + j'W) - H(a'X + 7'VK | Y) 
>l-(l-c 2 )(l-o( e ))-\ 

we see that if 6 is sufficiently small, then I(a'X + rfW] Y) > 1 — 6. ■ 
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